Joint learning of agents and graph embeddings in a conveyor belt control problem
Annotation
We focus on the problem of routing a conveyor belts system based on a multi-agent approach. Most of these airport baggage belt conveyor systems use routing algorithms based on manual simulation of conveyor behavior. This approach does not scale well, and new research in machine learning proposes to solve the routing problem using reinforcement learning. To solve this problem, we propose an approach to joint learning of agents and vector representations of a graph. Within this approach, we develop a QSDNE algorithm, which uses DQN agents and SDNE embeddings. A comparative analysis was carried out with multi-agent routing algorithms without joint learning. The results of the QSDNE algorithm showed its effectiveness in optimizing the delivery time and energy consumption in conveyor systems as it helped to reduce mean delivery time by 6 %. The proposed approach can be used to solve routing problems with complex path estimation functions and dynamically changing graph topologies, and the proposed algorithm can be used to control conveyor belts at airports and in manufacturing workshops.
Keywords
Постоянный URL
Articles in current issue
- Pulse recording of dynamic holograms in bismuth silicate crystal in a broad wavelength range
- Hybrid endoscope with television and multispectral image processing for the internal organs cancer early diagnostics
- Modelling of a composite waveguide holographic display
- Application of infrared spectroscopy methods in studying compositions for paper sizing
- Distribution optimization method of pixel density by surveillance area
- Evaluation and development of a method for compensating the positioning error of computer numeric control equipment
- Compensation of output external disturbances for a class of linear systems with control delay
- Luminescence technique for studying the growth of AgInS2 quantum dots
- Peculiarities of pulsed laser deposition of thin InGaAsN films in an active background gas atmosphere
- Determination of the electron distribution in thin barrier AlGaAs/GaAs superlattices by capacitance-voltage profiling
- Spectral and kinetic properties of silver sulfide quantum dots in an external electric field
- Influence of nano-sized horizontal inhomogeneities on surface profiling by means of XPS
- Organic light-emitting diodes with new dyes based on coumarin
- Fabrication and characterization of hybrid composite of Al6082/SiC/rice husk powder using friction stir processing
- A multi-path secure routing for the detection of node capturing attack in wireless sensor network
- A method for documenting architectural solutions of computing platforms
- Improving out of vocabulary words recognition accuracy for an end-to-end Russian speech recognition system
- Method for monitoring the state of elements of cyber-physical systems based on time series analysis
- Application of the text wave model to the sentiment analysis problem
- Automated evaluation of ECG parameters during the COVID-19 pandemic
- Multi-agent adaptive routing by multi-headattention-based twin agents using reinforcement learning
- Simulation of radiative transfer in gas-liquid foams
- The effect of signal-to-noise ratio value on the error in measuring acoustic emission parameters: statistical assessment
- Simulating the process of steady-state thermoreflectance for measuring the thermal conductivity of materials
- Modeling and simulation ofone- and two-row six-bladed ducted fans
- Differential-difference model of heat transfer in solids using the method of parametric identification